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Abstract 

We present a study of the Higgs production at the LHC via Weak Boson Fusion, 
with the Higgs boson decaying into a bb pair. A detailed partonic LO calculation of 
all the potential backgrounds is performed. We conclude that this channel for Higgs 
production can be extracted from the backgrounds, and present our estimates of 
the accuracy in the determination of the Hbb Yukawa coupling. 
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1 Introduction 



A Higgs boson in the so-called low-mass region (115 < m^(GeV) < 140) decays pre- 
dominantly in bb final states. Due to the large inclusive QCD backgrounds, detection of 
this decay is however extremely challenging. In particular, the extraction of the most 
copious signal, namely inclusive gg — > H — > bb production, has never been shown to be 
viable. The only production channels which have so far been proven to be suitable for a 
determination of the Hbb coupling are the associate production Htt and HW [1, 2]. 

In this note we document a study of the H — > bb decay in the electroweak boson 
fusion (WBF) production channel and of its backgrounds, and we discuss the potential 
of this process for the determination of the ymb Yukawa coupling. The signal rate is 
proportional to the product of the Dhvv coupling, where V denotes a weak W or Z 
boson, times the B(H — > bb) branching ratio. The contamination to the signal coming 
from QCD production of Higgs plus two jets (mediated by a loop of virtual top quarks) 
are not included in this analysis. Following the study of ref. [3], these will be suppressed 
by the particular set of kinematical cuts chosen in our analysis (see Section 2) . 

The results obtained are based on a leading order partonic calculation of the matrix 
elements (ME) describing signal and background processes. The latter include the fol- 
lowing channels: QCD bbjj production, Z(— > bb)jj, W/Z(—> jj)bb, ti — > bb + jets, QCD 
four jets production (where two light jets are misidentified as generated by b quarks), and 
contributions from multiple overlapping events. 

We identify a set of kinematical cuts leading to signal significances in the range of 2 — 
5cr, depending on the Higgs mass. In the lowest mass region, this provides a determination 
of the B(H — > bb) branching ratio with a precision of the order of 20%. The H — > bb 
decay in the WBF channel could be used together with other processes already examined 
in literature for a model independent determination of the ratio of Yukawa couplings 
V Hbb /y Htt [4]. We therefore conclude that the H — > bb channel produced in association 
with two jets should be considered as an additional channel to be exploited for interesting 
measurements of the Higgs couplings to fermions. 

This letter is organized as follows. In Section 2 we describe the kinematical constraints 
introduced to perform the event selection. Section 3 is devoted to the discussion of signal 
and backgrounds, while the signal significance and the accuracy of the branching ratio 
H — > bb and Yukawa coupling determination are presented in Section 4. In the Conclusions 
we summarise and discuss our final results. 

2 Event selection 

The choice of selection criteria is guided by two main requirements: the optimization of 
the signal significance (S/y/B), and the compatibility with trigger and data acquisition 
constraints. The main features of the signal, to be exploited in the event selection, are: 
presence of two, high-pr, b jets, showing an invariant-mass peak; presence of a pair of jets 
in the forward and backward rapidity regions. In principle such a signal could also exhibit 
rapidity gaps, due to the colour-singlet exchange of EW bosons among the incoming 
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Figure 1: The distributions are shown: high regions are more suppressed 
in the bbjj QCD background (solid) with respect to the signal (dashes). The 
inclusive distributions shown are normalised to the same cross section. 

hadrons; this fact has been used recently in [5]. Because of the high luminosity (and the 
large number of overlapping events) required to study this final state, and because of the 
large emission rate for extra jets in WBF processes (see [6]), we do not feel comfortable 
with applying this additional constraint in our study. 

The tagging of the b jets is only possible in the central region \r]b\ < 2.5. The efficiency 
of the tagging algorithm, furthermore, suggests using a cut as large as possible. Since 
the measurement of the Higgs boson in this channel will take place only after its discovery 
and the determination of its mass, we can optimize the mass requirement by selecting only 
b pairs in a mass window centred around the known value of ran, up to the dijet mass 
resolution. These considerations lead to the following set of cuts: 

p b T > 30 GeV (1) 

\Vb\ < 2.5 (2) 

AR bb > 0.7 (3) 

\m bb -m H \ < 5 m -m H , (4) 

8 m being the experimental resolution ~ 12%. Given the very small width of the Higgs 
boson in the mass range we shall consider (m# < 140 GeV), this last requirement reduces 
the signal to 68% of what obtained with perfect mass resolution. In the following we 
shall assume a 6-tagging efficiency e b = 0.5. While harder cuts on p\ would improve the 
S/B ratio, they would also risk sculpting the mass distribution, setting a higher value for 
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Figure 2: The distribution for rrijj is shown both for the signal (dashes) and 
for the bbjj QCD background (solid). The inclusive distributions shown are nor- 
malised to the same cross section. 

the dijet mass threshold and therefore making it harder to extract the background shape 
directly from the data. 

The large momentum exchange required for the emission of the space-like gauge bosons 
will lead to a hard p J T spectrum for the forward and backward light jets. This is clearly 
shown in Fig. I 3 , where we see that the jet p^ peaks at approximately 30 GeV. The 
spectrum of typical QCD backgrounds will viceversa peak at low p° T . The large momentum 
of the forward jets, and their large rapidity separation, favours large dijet invariant masses, 
as can be seen from Fig. 2. The cuts we select for the two jets are: 

y T > 60 or 80 GeV (5) 

\Vn-Vn\ > 4.2 (6) 
AR jj: AR jb > 0.7 (7) 
mjj > 1000 GeV. (8) 

The large p 3 T cut is driven by the requirement that trigger rates be kept at acceptable 
levels (see later). We present the two cases of 60 and 80 GeV to display the sensitivity to 
this threshold. A final choice will presumably only be possible with a complete detector 
simulation, or once the background data will be available. As we will comment later, 
the cut on p* T above 80 GeV is also very efficient in decreasing the backgrounds due to 

3 Thc distributions shown in the first two figures are obtained by applying no cuts to the signal, and 
the following minimal cuts on the background: p J T > 20 GeV, \r]\ < 5 GeV, ARjj^jb > 0.2. 
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multiple overlapping events. The large mass cut is selected to reduce as much as possible 
the QCD jet backgrounds. This cut, in addition to the rapidity cut, is also efficient in 
removing the contamination from the process gg — > Hgg, as shown in ref. [3]. 

In addition to the above cuts, we shall consider two alternative selection criteria for 
the light-jet rapidities, labelled (a) and (b). The case (a) is given by: 

2.5 < \rij\ < 5, r} h r} h < 0, (9) 

while for the the case (6), we only have the condition: 

foil < 5. (10) 

In the case (b) we verified that requiring rrijj > 1000 GeV forces the product % ■ r] 2 to be 
negative for the largest fraction of the events. 

By inspection of the differential distributions for the variable ARbb we find that cutting 
ARbb < 2 for the configuration (a) gives an additional enhancement of the signal with 
respect to the backgrounds. 

3 The study of signal and backgrounds 

The background sources we considered include: 

1. QCD production of bbjj final states, where j indicates a jet originating from a light 
quark (u, d, s, c) or a gluon; 

2. QCD production of jjjj final states. 

3. Associated production of Z*/7* —> bb and light jets, where the invariant mass of the 
bb pair is in the Higgs signal region either because of imperfect mass resolution, or 
because of the high-mass tail of the intermediate vector boson. 

4. tt production 

5. tij production 

6. bbjj and jjjj production via overlapping events. 

The cases with 4 light-jet events are considered since the experimental resolution leads, 
for any tagging algorithm, to a finite probability of b tags in light jets (fake tags). We 
shall label light jets mistagged as b jets with the notation j b , and assume two possible 
values of fake tagging efficiencies e/ a fc e , 1% and 5%. While the first choice is probably 
optimistic, given the presence of real secondary vertices in jets containing a charm quark, 
the second is likely to be too conservative. As we shall see, however, the requirement of 
tagging both b jets renders in any case the backgrounds with real b quarks the dominant 
ones. 
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The calculation of signal and background events is based on the numerical iterative 
procedure ALPHA [7], as implemented in the library of MC codes ALPGEN [6]. While 
ALPGEN allows for the full showering of the final states, both in the case of signals and 
backgrounds, all our calculations are limited to the parton level. This is because a realistic 
estimate of the rates would anyway require a full detector simulation, which is beyond 
the scope of this paper. 



Table 1: Signal and background events for configuration (a), with p J T > 60 GeV, 
for three possible values of the Higgs mass. Q 2 = (pr 2 )- The jjjj entry includes 
the squared b— mistagging efficiency (ef a t e = 0.01). The first raw relative to the 
Z*/7* contribution refers to the effect of the physical mass tail, while the second 
raw refers to the finite experimental Z mass resolution, {Smz/raz = 0.12). The 
integrated luminosity is 600 fb _1 . The PDF set used is CTEQ4L. See the text for 
the description of other, smaller, backgrounds. 



m H 


115 GeV 


120 GeV 


140 GeV 


Signal 


3.0 x 10 3 


2.8 x 10 3 


1.1 x 10 3 


bbjj 


8.6 x 10 5 


8.0 x 10 5 


5.7 x 10 5 


jbjbjj 


6.4 x 10 3 


6.1 x 10 3 


4.1 x 10 3 


(Z*/Y - bb)jj 


5.5 x 10 2 


3.8 x 10 2 


1.0 x 10 2 


(Z -> bb) rcs jj 


1.3 x 10 3 


6.8 x 10 2 


1.1 x 10 1 


jbj © jbj 


7.5 x 10 3 


7.9 x 10 3 


9.0 x 10 3 



Table 2: Same as Table 1, for configuration (b). 



m H 


115 GeV 


120 GeV 


140 GeV 


Signal 


1.3 x 10 4 


1.2 x 10 4 


6.2 x 10 3 


bbjj 


6.0 x 10 6 


5.3 x 10 6 


4.7 x 10 6 


jbjbjj 


1.2 x 10 5 


1.1 x 10 5 


1.1 x 10 5 


(z*/r -> bb) 33 


4.5 x 10 3 


2.8 x 10 3 


1.1 x 10 3 


(Z -> bb) rcs jj 


1.6 x 10 4 


8.3 x 10 3 


7.7 x 10 2 


jbj ® jbj 


1.8 x 10 4 


1.9 x 10 4 


2.3 x 10 4 



The event rates are obtained using the parametrization of parton densities CTEQ4L. 
Given the overall uncertainties of the background estimates, the results are not sensitive 
to this choice. The renormalization and factorization scales have been chosen equal (Q). 
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Table 3: Same as Table 1, with p> T > 80 GeV. 



m H 


115 GeV 


120 GeV 


140 GeV 


Signal 


1.3 x 10 3 


1.2 x 10 3 


5.2 x 10 2 


bbjj 


2.4 x 10 5 


2.3 x 10 5 


1.9 x 10 5 


jbjbjj 


2.6 x 10 3 


2.3 x 10 3 


1.8 x 10 3 


(Z*/Y - bb)jj 


1.1 x 10 2 


6.6 x 10 1 


1.3 x 10 1 


(Z -> bb) rcs jj 


6.2 x 10 2 


3.4 x 10 2 


0.5 x 10 1 


jbj © jbj 


2.9 x 10 2 


3.2 x 10 2 


4.5 x 10 2 



Table 4: Same as Table 3, for configuration (b). 



m H 


115 GeV 


120 GeV 


140 GeV 


Signal 


6.5 x 10 3 


6.4 x 10 3 


3.1 x 10 3 


bbjj 


2.8 x 10 6 


2.2 x 10 6 


2.1 x 10 6 


jbjbjj 


5.6 x 10 4 


5.3 x 10 4 


5.2 x 10 4 


(Z*/Y - bb)jj 


3.0 x 10 3 


1.9 x 10 3 


7.5 x 10 2 


(Z -> bb) Ics jj 


1.1 x 10 4 


6.0 x 10 3 


5.6 x 10 2 


jbj © jbj 


1.1 x 10 4 


1.2 x 10 4 


1.6 x 10 4 



In order to be conservative in the background estimates, we selected as a default for 
our study a rather low scale, namely Q 2 = (p^), where the average is taken over all 
light and b jets in the event 4 . In view of the large s values of the elementary processes 
involved, due in particular to the large mass threshold for the pair of forward jets, we 
believe that our background rates may be overestimated by a factor of at least 2. In spite 
of this we prefered the conservative approach, in order to present a worse-case scenario. 
The backgrounds are much more sensitive to the scale choice than the signal, due to the 
larger power of a s . The background uncertainty will not however be a limitation to the 
experimental search, since the background rate should be determined directly from the 
data, as we shall discuss. 

Tables 1-4 present our results for signal and backgrounds, for the following cases: (i) 
> 60 GeV and rapidity configuration (a); (ii) p J T > 60 GeV and rapidity configuration 
(b); (Hi) p 3 T > 80 GeV and rapidity configuration (a); (iv) pi^ > 80 GeV and rapidity 
configuration (b). The numbers correspond to 600 fb _1 of integrated luminosity, namely 
the expected value for three years of running of ATLAS and CMS with an instantaneous 
luminosity of 10 34 cm _2 sec _1 . The numbers relative to final states with mistagged jets 

4 We also repeated our analyses with Q 2 = m 2 H , finding comparable results. 
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include the square of the mistagging probability tf a ke = 0.01. 

We shall now discuss each individual background contribution in detail. 

3.1 Single- interact ion events 

The 4-jet backgrounds originating from a single hard collision are shown in the second and 
third rows of Tables 1-4. In the case of the jbjbjj background, we accept all events in which 
at least one pair of light jets passes the cuts in eqs.(l)-(4), and the other two jets satisfy 
eqs.(5)-(8), in addition to the appropriate rapidity cut (eq.(9) or (10)). As anticipated, 
the contribution from real b jets is the dominant one, even assuming e^ke = 0.05. 

From the numbers in the Tables 5 and 6, we see that the Sj \f~B can be as large as 5. 
However, the ratio S/B is only a fraction of a percent. This implies that the background 
itself will have to be known with accuracies at the permille level. There is no way that 
this precision can be obtained from theoretical calculations. The background should 
therefore be determined entirely from the data. We expect our kinematical thresholds to 
be low enough not to sculpt the shape of the bb mass distribution at masses close to the 
Higgs mass. This is true for the leading 4 jet backgrounds, as shown in Fig. 3. The bb 
invariant mass of the simulated bbjj background is shown here to be well behaved in the 
[100, 150] GeV region. The distribution in the case of the jbjbjj final states is similar. 
As a result, we expect that the sidebands of the Higgs signal (the regions of mass below 
m#(l — 5 m ) and above m#(l + 5 m )) can be safely interpolated in the region under the 
Higgs peak, similarly to what was done by UA2 in the extraction of the W/Z — > jj 
decay [8]. 

For this extraction to be possible, however, full background samples have to be col- 
lected. The large rate of untagged jjjj events could therefore give problems with the 
triggers and with the data acquisition. This is because the b tagging algorithm is typ- 
ically applied only offline, and therefore a number of untagged jjjj events larger than 
what is acceptable by the trigger and by the data acquisition would force higher cuts, or 
a trigger prescaling, strongly reducing the number of recorded signal events. Removing 
the fake-tagging probability from the numbers in the Tables 1-4, leaves untagged jjjj 
rates in the range of fewxlO 7 and 10 9 , depending on whether configuration (a) or (b) is 
chosen. Since the mass window for the signal is approximately 30 GeV wide, these rates 
must be increased by a factor of 3-4, to allow for a sufficient coverage of the sidebands 
of the bb mass distribution, coverage which is required to enable the interpolation of the 
background rate under the Higgs mass peak. The numbers in the Tables 1-4 refer to 6 
years of data taking, corresponding to 6 x 10 7 s, distributed among the two experiments. 
The result is a rate of events to tape in the range of 1 Hz (for configuration (a) with 
80 GeV jet threshold) up to 50 Hz (for configuration (b) with 60 GeV jet threshold). 
While a 1 Hz rate to tape is acceptable, 50 Hz would almost saturate the expected data 
acquisition capability of 100 Hz. In this last case, some extra information would have to 
be brought into the trigger. The best candidate is some crude 6-tagging. If a rejection 
against non-6 jets at the level of 20% per jet could be achieved at the trigger level, the 
rates would be reduced by a factor of 20, down to perfectly acceptable levels. 



7 




Figure 3: The distribution of the invariant mass of the system bb in the bbjj QCD 
background (solid line), and in overlapping events of the type (bb) © (jj) (dashed 
line). The curves are normalised to the same cross section. 

While the above processes represent the largest contribution to the backgrounds, the 
smoothness of their mass distribution in the signal region allows to estimate their size 
with statistical accuracy, without significant systematic uncertainties. The situation is 
potentially different in the case of the backgrounds from the tails of the Z decays. The Z 
mass peak is sufficiently close to m#, especially in the case of the lowest masses allowed 
by current limits, to possibly distort the spectrum and spoil the ability to accurately 
reconstruct the noise level from the data. The size of the two possible effects (smearing 
induced by the finite experimental energy resolution and the intrinsic tail of the Drell-Yan 
spectrum) are given in the 4th and 5th rows of the Tables 1-4. Aside from the case of the 
largest ran value, where these backgrounds are anyway negligible, the dominant effect is 
given by the detector resolution. For the configurations (a) these backgrounds represent 
a fraction of the order of at most 40% of the signal, at small run, rapidly decreasing 
at higher mn- For the configuration (b), the rates are comparable to the signal at low 
ran- A 10% determination of these final states, which should be easily achievable using 
the [Z — > £ + £~)jj control sample and folding in the detector energy resolution for jets, 
should therefore be sufficient to fix these background levels with the required accuracy. 
As for the contribution of the on-peak (Z — > bb)jj events to the determination of the 
sideband rates, we verified that their impact is negligible. We obtain a number of the 
order of 60K events with 600 fb^" 1 in the mass range 83-100 GeV, for configuration (b) 
and pt > 80 GeV for the forward jets. These events can therefore be subtracted from 
the sidebands with a statistical accuracy better than 1% using the measurement of the 
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on-peak (Z — > £ + £~)jj final states. It should be pointed out that extrapolating from the 
leptonic to the 66 rates with this accuracy requires a matching precision in the knowledge 
of the tagging efficiencies, something which remains to be proven. 

Before concluding the list of single-interaction backgrounds, we briefly comment on 
the smaller contributions, pp — > tt and pp — > ttj, with t decaying hadronically. Before 
applying the cuts, we adopt a clustering algorithm for the jets coming from the decay 
of a W. We sum the four-momenta every time the separation between the two jets is 
below the threshold AR = 0.4. This happens quite often, since in order to have a pair of 
jets in the event with an invariant mass above 1 TeV at least one of the two Ws coming 
from the t decays must have a large boost. After this clustering algorithm, using the 
event selection (6), about 300 tij events survive the cuts at 600 fb _1 , while the number 
of ti events is negligible. The configuration (a) leads to even smaller rates. The absolute 
rate can be fixed using the data, by reconstructing the individual tops. This should be 
particularly simple, since the request of large dijet mass forces the t and i to be very well 
separated, and the large momentum of the W's will reduce the combinatorial background 
in the association of the b jets with the W jets. 

3.2 Overlapping events 

We come now to the study of events due to the superposition of multiple pp interactions. 
The reason why these events are a potential problem is that while production of large 
dijet invariant masses in individual events is strongly suppressed energetically, these can 
accidentally appear when mixing jets produced in separate events (after all the overall 
energy available in 2 collisions is twice that for a single pp collisions): for example, we can 
consider two events, one in which a small-mass dijet pair is produced with large positive 
rapidity, the other in which a low-mass pair is produced at large negative rapidity; the 
pairing of jets from the two events will lead to large rapidity separations, and to large 
dijet masses. 

In the simplest case of two overlapping events, we have four possible combinations of 
events leading to a bbjj background: (jj) © (66), (jj) © (j b j b ), (jj b ) © {jj b ) and 
(66) © (66), where (ab) = pp ab. Since we do not veto on the presence of extra jets, 
triple events such as (jij b ) © {332) © {jbj) are also possible. The probability of having 
n simultaneous events with a jj final state during a bunch crossing, assuming a bunch 
crossing frequency of (25 ns)" 1 , is given by the Poisson probability distribution function 
7r n (/i) with average \x = 0.25 x a(pp — > jj)/mbarn x C/C , where £ is the instantaneous 
luminosity and £ — 10 34 cm _2 sec _1 . 

To estimate the rates, we first generate a sample of unweighted events of the type 
pp — > jj. We then randomly extract from this sample n-tuples of dijet events, which are 
associated to events where n dijet pairs from n proton-proton collisions are created in the 
same bunch crossing. The background can be then estimated as: 

N bg = B x (ir 2 (n)p 2 + 7r 3 Gu)p 3 + ...), (11) 

where B is the number of bunch crossings accumulated during the run time, and p n = 
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Figure 4: The distribution of the invariant mass of the system bb in the jbj © jbj 
multiple-collision QCD background, for configuration (a). 

f n /N n , {n = 2,3), where N n is the total number of n-tuple events generated, /2,/3 are 
the number of double and triple events passing the selection cuts found in the sample 
of generated events. Ellipses denote simultaneous collisions of higher order. Since 7r n (/x) 
drops quite rapidly with increasing n, we limit our analysis at n — 3. The above formula 
can be easily modified to include the presence of a(pp — > bb) events. All numbers given 
below refer to the case of high luminosity, namely 10 34 cm _2 s _1 . Since these rates scale 
quadratically, they should be reduced by a factor of 100 in the case of 10 33 cm _2 s _1 . 

We verified that the most dangerous background comes from events of the type (jjb) © 
(jjb)- The main reason is as follows: since the forward, non-tagged jets are required 
to have a large p T threshold (60 or 80 GeV), the fake b jets in the central region will 
inherit the same transverse momentum cut, as they are produced back-to-back with the 
related forward jet. As a result, the invariant mass spectrum of the jbjb pair will have 
a shape peaked at about twice the cut, and therefore right in the middle of the signal 
region. Typical shapes of the rribb spectra are given in Fig. 4, for configuration (a) (The 
shapes for configuration (b) are very similar). In the case of 60 GeV, the signal regions 
are right in the middle of the background peak, or on its rising slope; this makes the 
background estimate very sensitive to the assumed energy resolution, both in the forward 
region (since the energy scale in the forward region affects the onset of the trigger for the 
forward jets, thus affecting the spectra of the central jets recoiling against them) and in 
the central region as well (since the mass spectrum is rapidly rising in the 100-150 GeV 
range. Our results were obtained by assuming a forward jet energy resolution given by 
<Jfwd = VE © 0.07 E, in addition to the 12% mass resolution used earlier for the central 
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jets. The distributions in Fig. 4 include this resolution smearing. The rates obtained 
after including the resolution effects are approximately twice as large as those obtained 
with perfect resolution, stressing the importance of these effects. In absolute terms, the 
Tables 1-4 show that these contributions are of the same order of magnitude as the signal 
when pp > 60 GeV is used, but much smaller when the higher p? T thereshold is used. In 
the former case, these final states are a potential threat, unless a way can be found to 
estimate from the data their exact size. This cannot be done using the mass spectrum in 
the sideband regions, since the rate is too small compared to the leading 4-jet processes. 
We believe that it should be possible however to use the distribution of the z vertex 
separation between the two events as a diagnostic tool. Since the two tagged jets come 
from different pp events, and given that the spread of the interaction point in z is of 
the order of few cm, the fraction of overlapping events where the z positions of the two 
vertices cannot be separated should be of the order of 10%, a number measurable by 
extrapolating the Az distribution from large values, down to the range in which Az is of 
the order of the experimental resolution. 

Other sources of backgrounds from overlapping events are less dangerous. Events 
where the bb or jbjb pair comes from the same hard interaction ((bb)®(jj) and (jbjb)®(jj)) 
have a smooth mass spectrum in the 100-150 GeV region, and rates smaller than those 
of the single-interaction bbjj or jbjbjj events. The mass spectrum of (bb) © (jj) events is 
shown in Fig. 3 5 . Their contribution can therefore be estimated precisely from the data 6 . 
In the specific case of m H = 120 GeV, for example, we obtain the following numbers 
of events: 10 5 and 4 x 10 5 (jj) © (bb) events for p T > 60 GeV in the configurations 
(a) and (b), respectively; 6 x 10 4 and 2 x 10 5 (jj) © (bb) events for p T > 80 GeV in 
the configurations (a) and (b), respectively. The contributions from (jj) © (jbjb) fi na l 
state are smaller by a factor of approximately 12, independently of the configuration and 
transverse momentum thresholds, and assuming €f a k e = 0.01. 

Events of the kind pp — > bb © pp — > bb turn out to be totally negligible, at the level 
of 40 with the p{ > 80 GeV cut. 

The events from three separate pp collisions contribute less than 10% of the two- 
collision rates shown in the Tables 1-4, at 10 34 cm -2 s -1 . 

4 Results 

Tables 5-8 summarize our results for the sensitivity defined as the ratio of the number of 
signal events divided by the square root of the number of background events for different 
values of the mistagging efficiency ef a ke- Tables 9,10 show our results on the determination 
of the branching ratio B(H — > bb) and accordingly on the Hbb Yukawa coupling ymb, 

5 The sharp threshold at approximately 70 GeV is due to the fact that the b and b are mostly produced 
back-to-back, coming from a 2 — > 2 scattering; in the case of the single-interaction bbjj events the b and 
b can be produced at relative angles as small as allowed by the Ai?bb > 0.7 cut, and the threshold onset 
is smoother. 

6 Of course their individual contribution may not be easily obtained; what can be estimated is the 
overall rate of 4-jet events, including both double- and single-collision contributions. 
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Table 5: The sensitivity, denned as the ratio of the number of signal events divided 
by the square root of the number of the background events. The mistagging 
efficiency of light jets, ejake-, is e/ a fce = 0.01. The integrated luminosity is 600 fb _1 
for both configurations (a),(b), and the transverse momentum cut on jets is > 
60 GeV. 





115 GeV 


120 GeV 


140 GeV 


(a) S/y/B 


3.0 


2.9 


1.4 


(b) S/y/B 


5.1 


5.2 


2.7 



Table 6: The same as Table 5, with p{ > 80 GeV. 





115 GeV 


120 GeV 


140 GeV 


(a) S/y/B 


2.4 


2.3 


1.0 


(b) S/y/B 


3.7 


4.1 


2.0 



assuming the knowledge of the HWW coupling. This can be determined using other 
channels, as discussed in the literature [9]. These results rely also on the assumption of 
SU(2) invariance to relate the contributions to the signal coming from the HWW and 
HZZ couplings, which can not be experimentally disentangled in the WBF production 
mechanism. With a total luminosity of 600 fb -1 , a relative precision of about 20% on 
the B(H — > bb) branching ratio can be attained. This represents an improvement with 
respect to what obtained in other channels [10, 11]. As for the Hbb Yukawa coupling, 
a statistical significance of at best 30% is reachable 7 . The significance is rather flat in 
the 115-140 GeV mass range, as a result of the compensation between overall rate (which 
decreases at larger masses) and sensitivity of the BR to the Yukawa coupling (sensitivity 
which increases at smaller BR, for larger masses). The effect of applying a larger cut 
(80 GeV) on the transverse momentum of forward jets is to reduce by approximately 10% 
the statistical accuracy of the measurement. This choice could however turn out to be 
more reasonable in view of the reduced experimental difficulties at larger p^. 

The H — > bb decay in the WBF channel also allows for a model independent de- 
termination of the ratio of widths T(H — > bb)/Y{H — > t + t~) when combined with the 
qq — > qq(H — > t + t~) mode [12]. This determination can be compared with what obtained 
in the ttH production channel by [11]. Moreover, comparing the WBF mechanism stud- 
ied in this paper with the associated W(H — > bb) production, one could test the 577(2) 
relation between the SM HWW and HZZ couplings for low Higgs masses. 

7 Thc statistical significance of the 6-quark Yukawa coupling is linked to the one of the branching ratio 
by the following formula: SyHbb/yHbb — SB/ (2/3(1 — B)), where B stands for the branching ratio H — > bb. 
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Table 7: The same as Table 5 but with a mistagging efficiency of ef a ke = 0.05. 





115 GeV 


120 GeV 


140 GeV 


(a) S/^fB 


2.5 


2.4 


1.1 


(b) S/^fB 


4.4 


4.2 


2.1 



Table 8: The same as Table 6 but with a mistagging efficiency of ej a /% e — 0.05. 





115 GeV 


120 GeV 


140 GeV 


(a) S/^fB 


2.2 


2.1 


1.0 


(b) S/VB 


3.1 


3.3 


1.6 



5 Conclusions 

In this letter we examined (if — > bb)jj production at the LHC, with the goal of assessing 
the potential accuracy in the determination of the ymb Yukawa coupling. A study of the 
observability of this channel has also been presented in ref. [5]. We believe our paper 
provides a more realistic evaluation of the experimental challenges of this measurement, 
and find less optimistic results. 

In particular, we identified two main sources of backgrounds: 

• 4 jet final states: these are over 100 times larger than the signal, but could be 
evaluated with accuracy using the sidebands of the bb mass spectrum. This requires 
however some tagging information to be available at the trigger level, to reduce to 
acceptable levels the data storage needs for inclusive, untagged, 4 jet final states. 

• 4 jet final states from multiple collisions: a large contribution comes from events 
of the type (jjb) © (jjb), where the bb mass spectrum has a broad peak in the 
middle of the signal region. The absolute rate of these events (of the order of the 
signal rate, when using the lower transverse momentum threshold of 60 GeV) can be 
determined if the distribution of the z vertex separation between the two overlapping 
events can be determined with a resolution of the order of 5-10mm. These events 
are significantly reduced in number when using the higher threshold of 80 GeV for 
the forward jets. 

Our parton-level analysis should be completed with a full detector simulation, but, already 
at this stage, it provides a strong indication for the relevance of this channel for the 
B(H — > bb) branching ratio. We have shown in fact that the B(H — > bb) can be measured 
with a 20% precision for an Higgs mass around 120 GeV assuming that the coupling 
HWW is the one predicted by the Standard Model or determined in other reactions 
already studied in the literature. We also observe that the WBF channel we study, 
combined with other processes, can be used for a model independent determination of the 
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Table 9: The statistical significance of the determination of the branching ratio 
T b /T and of the 6-quark Yukawa coupling in the configurations (a) and (b). A 
luminosity of 600 fb^ 1 is assumed; the transverse momentum cut on jets is > 
60 GeV. Here ej a k e = 0.01. Using ej a ke = 0.05 will worsen these estimates by 
approximately 20%. 





m H 


115 GeV 


120 GeV 


140 GeV 


(a) 


ST b /T 


0.33 


0.35 


0.71 




Symb/ Vmb 


0.58 


0.51 


0.56 


(b) 


sr b /r 


0.20 


0.19 


0.37 




Symb/yiibb 


0.36 


0.30 


0.29 


Table 10: The same as Table 9 with p{ > 80 GeV. 






115 GeV 


120 GeV 


140 GeV 


(a) 


ST b /T 


0.42 


0.43 


1 




Symb/yHbb 


0.76 


0.68 


0.72 


(b) 


ST b /T 


0.27 


0.24 


0.50 




Symb/yiibb 


0.47 


0.40 


0.36 



VmblVHTT ratio and for a test of the ratio of the couplings gHww/dzww for low Higgs 
masses. 

To conclude, we should point out that all statistical accuracies listed in this study 
should be matched by an excellent control over experimental systematics, including the 
knowledge of 5-tagging efficiencies (needed for example to allow the determination of 
Z — > bb backgrounds from the measurement of Z — > final states) and their de- 

pendence on the b momentum, and of forward jet tagging efficiencies and fake (pile-up or 
calorimeter noise) rates. On the other hand, as mentioned at the beginning, we expect our 
estimates of the physics backgrounds to be very conservative, being based on very low Q 2 
scales for the evaluation of the strong coupling constant; furthermore, we anticipate that 
more sophisticated analyses based on kinematical correlations in the event (exploiting for 
example the scalar nature of the H bb coupling) will help improving the signal significance. 
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